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Persistence,  Runs,  and  Recurrence  of  Visibility 


I.  INTRODUCTION 

Duration,  persistence,  runs,  and  recurrence  are  all  interrelated,  l'o r this 
study,  they  have  been  defined  as  follows:  duration-continuous  successes;  per- 
sistence-consecutive successes  separated  by  1 hour;  runa-eonsecutive  successes 
separated  by  intervals  of  1 hour  beginning  and  ending  with  a failure;  and  recurrence- 
successes  occurring  at  time  t and  also  at  time  t+x  hours 

This  study  is  part  of  a more  comprehensive  investigation  conducted  to  obtain  a 
better  understanding  of  persistence,  runs,  and  recurrence  of  weather  events.  Dura- 
tion could  not  be  studied  because  the  data  were  observed  at  hourly  intervals.  Of 
major  interest  are  those  weather  events  which  are  usually  recorded  in  categories, 
for  example;  precipitation  recorded  as  none,  light,  moderate,  or  heavy;  or  sky 
cover  recorded  as  clear,  scattered,  broken,  or  overcast.  Persistence,  runs,  and 
recurrence  of  precipitation  and  sky  cover  are  described  m papers  by  l.und  and 
Grantham.  ' * ” 

Phis  report  includes  tables  of  observed  relative  frequencies  of  four  visibility 
categories  and  models  for  estimating  probabilities  of  each  category.  The  models 

(Keceived  for  publication  31  January  1978) 

1.  l.und,  1,  A. , and  Grantham,  1).  D,  ( 1977a)  Persistence,  runs,  and  recurrence 

of  precipitation.  J.  Appl,  Meteor.  16:346-3511. 

2.  l.und,  l.  A. , and  Grantham,  D.  D.  11977b)  Persistence,  Kvws.  and  Recurrence 

of  Sky  Cover.  KKP  No.  621.  APGL-TK-77-03O8. 
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provide  answers  to  such  questions  as:  What  is  the  probability  of  observing  a 
sequence  of  more  than  5 hours  of  visibilities  less  than  or  equal  to  3 utiles  (LK3); 
of  observing  a run  of  exactly  ft  hours  of  I.K3;  and  of  observing  1.K3  at  time  t and 
also  at  time  t+5  hours'*  The  models  require  a knowledge  of  the  unconditional 
probability  of  the  event,  in  this  case  a visibility  category,  and  a measure  of  the 
temporal  correlation  between  occurrences  of  visibility  categories. 


2.  DATA 

Records  of  hourly  visibility  observations  taken  in  winter  (December,  January. 
February)  and  summer  (June.  July,  August)  during  the  13-year  period  1951  through 
1963,  at  the  following  nine  stations,  shown  in  Figure  1.  were  studied: 

LG  A LaGuardia  Airport,  New  York,  NY 

JFK  Kennedy  International  Airport.  New  York.  NY 

FWK  Newark  Airport.  NJ 

PHL  Philadelphia  International  Airport.  PA 

HAL  Haltimore-W  ashmgton  International  Airport.  MU 

PGA  National  Airport.  Y\  ■shington.  DG 

A LAV  Andrews  A F B.  MU 

RIG  Byrd  Field,  Richmond,  YA 

RDU  Raleigh-Durham  Airport,  NG 


Figure  1.  Location  of  the  Nine 
Stations  Whose  W inter  and  Summer 
Hourly  Observations  of  \ isibiUty 
W ere  Studied 
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Each  hour,  approximately  on  the  hour,  a weather  observer  at  each  of  the  above 

stations  went  outdoors  to  make  a regular  hourly  observation.  One  of  the  weather 

3 

elements  recorded  is  visibility.  The  Federal  Meteorological  Handbook  describes 
how  the  observations  are  taken.  The  four  visibility  categories  studied  are  shown 
m Table  1.  The  abbreviations  LK  and  UK  Stand  for  "less  than  or  equal  to"  and 
"greater  than  or  equal  to",  respectively,  l'he  two  ends  of  the  frequency  distribu- 
tion were  studied  separately  to  determine  whether  temporal  correlation  is  a function 
of  visibility. 


Table  1.  Visibility  Categories 


Category 

\ lsibility  (miles) 

1 

UK  10.  0 

•> 

UK  5.  0 

3 

l.K  3.  0 

4 

LK  0.  25  (\\  inter) 

LK  1.0  (Summer) 

:i.  liVTM’Koc.issiM; 

Each  hourly  visibility  observation  was  categorized  as  follows:  UK  10  miles; 

UK  5 miles;  LK  3 miles;  LK  l mile  (in  summer),  or  1.K  0.  25  mile  (in  winter).  Some 
of  the  stations  had  no  missing  observations,  others  only  a very  few  . These  tew 
observations  were  filled  in  by  estimating  the  visibility  from  observations  taken  at 
nearby  stations  and  observations  taken  before  and  after  the  missing  observations. 
There  were  28,  080  )(24  observations /day)  X (90  days,  season)  X (13  seasons)] 
observations,  in  winter,  and  28,70-1  [(24  observations /day)  X \92  days  season)  x 
(13  seasons))  observations,  in  summer,  processed  for  each  station. 


4.  PTHSISTK.MI 
4.1  tthdfnfd 

The  occurrence  of  a given  visibility  category  was  denoted  as  a success.  S,  and 
non-occurrence  as  a failure,  F.  The  relative  frequency  ot  one  success.  111- (S  ', 
is  found  from  the  data  bv  dividing  the  number  of  times  the  visibility  category 


3.  I . S.  Department  of  Commerce  (1975)  Federal  Meteorological  Handbook  No.  1, 
Surface  Observations.  U.S.  Uovernment  Printing  Office.  W ashington.  DC 

303  pp" 
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occurred,  n(S^),  by  the  sample  size  N.  The  relative  frequency  of  two  successes 
in  a row,  RFiSg).  is  found  from  the  data  by  dividing  the  number  of  times  a success 
was  followed  by  a success,  niSg).  by  the  sample  size  N minus  the  end  effect,  in 
this  case  13,  because  there  were  13  years  when  the  next  season's  data  were  not 
used  to  determine  the  visibility  on  the  first  hour  of  the  next  season.  The  relative 
frequency  of  x successes  in  a row  RF(Sx>,  is  found  by  dividing  the  number  of  tim«.s 
x consecutive  successes  was  observed,  n(S  ),  by  the  sample  size,  N,  minus  the 
end  effects,  in  this  case  13(x-l). 

n(S  ) n(S  ) 

RF(Sx)  - N-13(x':'l)  * ~N  ‘ {1) 


This  processing  of  the  data  was  done  for  all  categories  for  all  nine  stations  in  both 
winter  and  summer. 

The  relative  frequency  of  a success  given  that  x consecutive  successes  have 
occurred,  RF(s|Sx>,  is  equal  to  the  relative  frequency  of  x+1  consecutive  successes, 
RF(Sx+^)  divided  by  the  relative  frequency  of  x consecutive  successes,  RF(Sx>, 
that  is. 


RF(S|Sx>  = 


RF(Sx+l> 

RP(5x>  • 


(2) 


I 

The  conditional  relative  frequencies  RFfS'S^)  were  computed  for  periods  up  to 
7 2 hours.  Selected  \alues  for  the  first  18  hours  are  shown  for  all  nine  stations  and 
all  four  visibility  categories  in  Tables  2,  3,  4,  and  5.  The  median  relative  fre- 
quencies are  indicated  with  asterisks. 

The  first  column  in  each  of  the  tables  gives  RF(s|So)  which  is  defined  as  RF(S), 
the  unconditional  relative  frequency  of  the  given  visibility  category.  Although  both 
the  unconditional  and  conditional  relative  frequencies  vary  from  station  to  station, 
there  is  often  no  consistent  pattern  to  the  variations.  It  was  subjectively  decided 
to  assume  that  the  data  from  all  stations  were  drawn  from  the  same  sample  and  to 
use  the  median  values  to  obtain  estimates  of  the  conditional  probabilities,  £(s|Sx), 
required  for  obtaining  estimates  of  probabilities  of  x+1  consecutive  successes. 


12 


O'" ‘ *t  *.*>  %n  *t> 

» » * j.  ^ » ,?>  » 


SinifO^  Nmnn 
iHCOO-DOi' 

» » * v»)  * Ji  ® j>  # 


« «t>  >f  A tO(-  >o  «t  *» 
OQCO>3)C1>OOCO<7)t'> 
OOdOOOJOc3>«?>{OcOX> 


|»  I ' J>  Jl  O O 4j  I' 
0O00SXIiS>i}>3O0pJ} 


>1  -«  w*  — O •• ■■•  O »r>  O 
M-DJiOO^JOh 
# 30  JO  # C*i  J>  » » # 


O i”  Jl  l"  » O O l-  N 

iO  ifih  >»  o <0  i ■ -o 

JO  JO  » JO  CO  *>  JO  » V 


§§g  £ 

*ft  I-  if)  IO 


8n  j o <7>  -•  in 

00  O iA  10  I-  «- 

ia  10  i«  ••  r-  »r>  to 


§§§oi§ 


I'.'ii-  »r  o o p o 
<o«r^»-*r-0000 

<0<0l0l*l'iAlAtAiA 


ro  »,->  o O «r  o o »r>  »r> 

C*">  CO  U"?  »A>  (?>  W>  ll"5  CM 

» I-  I-  I-  I-  •-  I-  I-  \0 


<*'  I-  I-  <£>  O 
6«Ni-  o. 


Om«t'iAaj.v'O(0 

.\3  ip  | - I-  O .0  ^ .«  « 
I-  00  (-  >0  t»  (£1 


a 

. S'  N 10  O ip  O 
1 co  -•  <0  ©>  <0  & <© 

»«0!I0(0  3)N* 


3 :£*  = <* 

J-jui^ac  ttaxa: 


3£isx<!Dy^5' 


«?■  tA  m?  N m?  I«  — * CTi 

10  lA  lA  ® I-  ^ I-  ~ 

" r»  10  JO  » op  it 

^'-'-''-OOCOO 


r5-,oja.^.r2X3; 


5S?s<a5sia 


US-2 

S-3.2 

u t2  u 
o ^ a> 

* 


m 


The  median  values  or  RFfs's  ) for  winter  and  summer  are  shown  in  Figure  2 
for  periods  up  to  18  hours.  The  median  relative  frequency,  HF(S),  of  the  most 
frequently  occurring  category,  GE  5 miles  was  0.7917  in  winter  and  0.8634  in 
summer.  Because  this  is  a frequently  occurring  category  there  were  mar.y  long 
sequences  of  successes.  The  median  conditional  relative  frequencies  for  GE  5, 
given  in  Table  6 and  shown  as  x's  in  Figure  2,  increase  in  magnitude  for  13  hours 
in  winter  and  15  hours  in  summer.  They  never  vary  significantly,  that  is,  by  more 
than  0.001  for  the  next  few  hours  after  hour  15,  therefore  the  estimated  conditional 
probabilities  are  regarded  as  constant  aft»r  hour  15.  The  conditional  relative 
frequencies  of  the  less  frequently  occurring  categories  are  more  variable  as  ex- 
pected but  they  never  depart  from  the  hour  15  conditional  relative  frequencies  by 
more  than  0.  023. 

Sample  relative  frequencies  of  x consecutive  successes,  RF(S^),  obtained 
from  the  data  sample,  are  given  in  Tables,  7,  8,  9,  and  10.  All  of  the  relative 
frequencies  for  hours  1 through  70  are  shown  in  Figures  3,  4,  5,  and  6. 
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Figure  2.  Relative  Frequencies 
of  Success,  Given  x Hours  of 
Consecutive  Successes  Have  Occurred, 
in  Winter  (a)  and  Summer  (b).  The 
curves  were  subjectively  drawn. 

The  dashed  portions  of  the  curves  are 
hased  on  fewer  than  30  cases 
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Table  10.  Relative  Frequencies  of  x Consecutive  Successes.  RFfSx),  Obtained  From  the  Data  Sample  When  LE  0.25  Miles 
Visibility  in  Winter  and  LE  1 Mile  in  Summer  is  Considered  a Success.  Median  values  are  identified  with  asterisks 
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Figure  3.  Relative  Frequencies 
of  x Hours  of  Consecutive  Successes 
in  Winter  (dots)  and  in  Summer  (X's) 
When  GE  10  Miles  Visibility  is  Re- 
garded as  a Success.  The  solid 
lines  are  solutions  to  Eqs.  (3)  and 
(6)  for  winter  and  the  dashed  lines 
are  for  summer  (see  text) 
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Figure  4.  Relative  Frequencies 
of  x Hours’  of  Consecutive  Successes 
in  Winter  (dots)  and  in  Summer  (X's) 
When  GE  5 Miles  Visibility  is 
Regarded  as  a Success.  The  solid 
lines  are  solutions  to  Eqs.  (3) 
and  (6)  for  winter  and  the  dashed 
lines  are  for  summer  (see  text) 
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Figure  5.  Relative  Frequencies 
of  x Hours  of  Consecutive  Successes, 
in  Winter  (dots)  and  in  Summer  (X's) 
When  LE  3 Miles  Visibility  is 
Regarded  as  a Success.  The  solid 
lines  are  solutions  to  Eqs.  (3) 
and  (6)  for  winter  and  the  dashed 
lines  are  for  summer  (see  text) 
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Figure  6.  Relative  Frequencies  of  x Hours  of 
Consecutive  Successes,  in  Winter  (dots)  and  in 
Summer  (X's),  When  LE  0.25  Miles  Visibility 
in  Winter  and  LE  1 Mile  in  Summer  is  Regarded 
as  a Success.  The  solid  lines  are  solutions  to 
Eqs.  (3)  and  (6)  for  winter  and  the  dashed  lines 
are  for  summer  (see  text) 
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4.2  Mudrlnl 


The  probability  of  a sequence  of  x hours  of  successes  is  sometimes  estimated 
with  the  following  first  order  Markov  chain: 

F(Sx>  * F(S)  [IMSlSj))*'1  (3) 

where  Pis'Sj)  is  the  probability  of  a success  given  a success  has  occurred  and 
x equals  the  number  of  hours. 

The  relative  frequencies  HK(SI  and  RFfS^Sj),  obtained  from  the  data,  are  given 
in  the  first  two  columns  in  Tables  2 to  5.  The  estimated  conditional  probabilities  . 
Pis’Sj),  shown  in  Table  6.  were  used  to  test  Fq.  (31.  The  model  fit  the  observed 
values,  within  a few  percent,  for  the  first  few  hours  but  there  were  large  differ- 
ences between  the  model  estimates  and  corresponding  sample  relative  frequencies 
when  probabilities  of  sequences  of  successes  longer  than  a few  hours  were  esti- 
mated. Figures  3 to  6 illustrate  differences  between  model  estimates  and  observed 
relative  frequencies  when  the  nine  station  median  relative  frequencies  are  used  to 
represent  the  sample  values.  These  figures  illustrate  the  failure  of  Kq.  (3)  to 
adequately  estimate  long  sequences  of  successes.  For  example,  note  the  large 
departures  of  the  first  order  Markov  model  estimates,  labelled  (3),  from  the  sample 
relative  frequencies  of  visibility  l.K  3 miles  shown  in  Figure  5. 

To  improve  the  model  given  in  Kq.  (3)  the  following  axiomatic  expressions  can 
be  estimated  2 and  x hours  of  consecutive  successes,  respectively 

P’(S  )■  P(S)P(slSj)  <•*> 

P'(S  1-  P(S)  Pis's,).  ..  P(s's  1 x?3  (5) 

x 1 x-i 

where  l’(s'Sj)  is  the  probability  of  a success  given  that  a success  occurred  the 
previous  hour,  and  P(s'Sx_jl  is  the  probability  of  a success  following  (x-11  hours 
of  unbroken  successes. 

The  probabilities  required  for  the  solution  of  Kq.  (5)  were  estimated  from  the 
relative  frequencies  and  it  was  assumed  that  the  conditional  probabilities  were 
always  constant  beyond  IS  hours.  To  estimate  joint  probabilities  Kq.  (5)  was 

expressed  as  follows: 


'"'V  ■ 

P(S> 

P(S 1 

s*-l> 

x»2 

1 

P(S) 

P(S 1 

V P(S'Sx-l' 

3 < \ v 15  \ 

(6) 

P'(S  ) 

P(S> 

P(S 1 

S,1 Pfs's,.!  I Pfsls,.))  *''5 

xS  16 

1 

Curves  were  drawn  for  the  points  in  Figure  2 and  estimates  of  P(S^S.)  for  use 
in  Kq.  (6)  were  obtained  from  the  curves.  These  values  are  given  in  Table  6 in  the 
rows  labelled  PtS'S^).  With  the  exception  of  the  low  visibility  category,  the  con- 
ditional relative  frequencies  increase  fairly  steadily  but  at  a slower  rate  as  x 
approaches  15  hours.  As  stated  earlier,  the  conditional  relative  frequencies  remain 
almost  constant  after  15  hours.  Because  the  low  visibility  category  occurs  less  than 
2%  of  the  time,  the  shape  of  the  curves  is  uncertain.  The  curves  are  dashed 
where  relative  frequencies  arc  based  on  less  than  20  cases  to  emphasize  the 
uncertainty. 

Solutions  to  Fq.  (6),  are  shown  by  the  curves  in  Figures  3 to  6.  They  are  in 
better  agreement  with  the  relative  frequencies  than  curves  based  on  Kq,  (31.  This 
must  be  the  case,  because  the  probability  estimates  for  Fq.  (6)  are  based  more 
closely  on  the  relative  frequencies.  Modeling  is  involved  in  the  smoothing  of  the 

relative  frequencies  and  in  the  assumption  that  P(s|s  I is  constant  beyond  15  hours. 

4 x 

Gringorten  simulated  probability  distributions  by  a Monte  Carlo  exercise  and 
prepared  charts  for  use  in  estimating  the  duration  of  weather  events.  These  charts 
were  used  to  estimate  conditional  probabilities  of  the  four  categories  of  visibility, 
llour-to-hour  correlation  for  this  application  of  Gringorten's  method  was  assumed 
to  be  0.  <150  in  winter  and  0.  032  in  summer.  Table  6 shows  that  the  conditional 
probability  estimates  obtained  bv  Gringorten's  method  are  usually  lower  than  those 
obtained  from  the  data.  This  is  consistent  with  expectations  because  they  represent 
duration  probabilities,  not  persistence  probabilities.  One  shortcoming  of  the  method 
is  that  it  is  graphical.  It  is  difficult  to  estimate  the  probabilities  from  the  charts. 

A promising  analytical  method  described  in  the  treatise  by  Keilson  and  Boss'  needs 
further  development  before  it  can  be  applied  to  this  problem.  In  the  absence  of 
conditional  relative  frequencies  obtained  from  large  data  samples,  Gringorten's 
method  can  provide  suitable  estimates  of  the  probabilities  required  for  the  solution 
of  Fq.  (6).  Because  a large  sample  of  data  was  available  for  this  study,  a smooth 
subjective  fit  to  the  relative  frequencies  was  used  to  estimate  the  required  proba- 
bilities. 

Table  1 1 summarizes  some  of  the  information  obtainable  from  Kq.  (61  and  the 
the  data  shown  in  Figures  3 to  6.  It  shows  the  number  of  hours,  \,  that  each  of 
the  four  visibility  categories  was  estimated,  and  observed,  to  »>ersist.  at  six 
probability  levels  based  on  PIS^)  and  HF  (S^),  respectively.  For  example,  line  1 
of  Table  11  shows  that  GF  10  miles  visibility  has  a climatic  occurrence  probability 

4.  Gringorten,  I.  1.  (1066)  A stochastic  model  of  the  frequency  and  duration  of 

weather  events,  .1.  Appl,  Meteor.  ^606-624. 

5.  Keilson,  .1.,  and  Boss,  II.  F.  (1075)  Passage  Time  Distributions  for  Gaussian 

Markov  (Ornstein-l’hlenbeck)  Statistical  Processes.  Selected  Tables  in 
Mathematical  Statistics  Vol,  111.  American  Mathematical  Society , 

Providence.  Bhode  (stand,  pp.233 -327, 
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of  0.  5397.  in  winter,  and  that  50%  of  the  time  GE  10  would  be  expected  to  be  ob- 
served for  at  least  2 hours.  Approximately  25%  of  the  time  GE  10  is  expected 
to  be  observed  for  at  least  17  hours.  GE  10  is  expected  to  persist  for  37,  52,  "*72, 
and  > 72  hours;  10%,  5%,  1%.  and  0.  1%  of  the  time,  respectively,  as  can  be  seen 
from  the  solid  curve  based  on  Eq.  (6)  shown  in  Figure  3.  The  corresponding 
observed  values  are  shown  in  parentheses  in  the  table. 

The  values  given  in  Tables  11  and  12  are  for  eastcoast  stations.  They  apply 
elsewhere  only  to  the  extent  that  the  probability  of  the  event  and  the  hour-to-hour 
correlation  is  the  same. 

Table  12  summarizes  some  of  the  information  obtainable  from  the  following 
equation. 


P(S  Is  ’ 

X 


p(s,tl' 

— w 


(7) 


This  equation  is  used  to  estimate  the  conditional  probability  of  observing  a sequence 
of  x hours  of  a weather  category.  It  can  be  used  to  answer  questions  such  as; 
given  that  the  visibility  is  GE  10  miles,  how  many  hours  will  it  be  before  there  is 
a 50%  probability  that  the  sequence  of  GE  10  visibilities  will  be  broken.  The  un- 
conditional probability  P(S)  is  assumed  to  be  known  and  P(S^  ! S)  must  equal  0.  50. 
Substituting,  for  example,  the  winter  unconditional  probability  of  observing  GE  10 
miles,  0.5397,  into  Eq.  (7),  it  becomes 


P(SxM>  = (0.50)  (0.5397)=  0.2699. 


(7  a) 


Solutions  to  Eq.  (6)  for  P(Sx+  j)  for  GE  10  are  shown  in  Figure  3.  It  can  be  seen 
that  P(S  j)  = 0.2699  when  x+ 1 = 15  hours,  therefore,  GE  10  is  expected  to  per- 
sist for  more  than  14  hours  about  50%  of  the  time.  The  dots  in  Figure  3 show 
values  of  RF(S^),  for  winter.  It  can  be  seen  that  RFtS^)  = 0.2699  when  x=15  hours. 
At  this  point  the  model  and  the  data  are  in  good  agreement. 

Table  12  shows  that  from  Eq.  (7),  when  GE  10  is  observed  in  winter,  it  is 
expected  to  persist  14,  29,  49,  65,  '■7  2,  and  >72  hours;  50%,  25%,  10%,  5%,  1%, 
and  0.  1%  of  the  time,  respectively.  Corresponding  observed  values  are  shown  in 
parentheses. 

The  following  equation  can  be  used  to  estimate  how  long  a sequence  of  successes 
is  expected  to  persist,  given  that  the  sequence  has  just  begun 


P(S  1 FS) 
X 


P(FSx>,) 

~p<  rer 


(8) 


The  F preceding  the  S denotes  a failure  followed  by  a success. 
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Table  11.  The  Minimum  Number  of  Hours  that  Each  Visibility  Category  Was  Estimated  to  Persist,  at  Selected 
Probability  Levels  (see  text).  Observed  values  are  shown  in  parentheses 
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No  data, 


Equation  (8)  always  yields  smaller  values  than  Eq.  (7),  unless  the  process  is 
first-order  Markov  in  which  case  the  values  are  identical,  A table  of  values  based 
on  solutions  to  Eq.  (8)  was  not  prepared,  but  sufficient  information  is  included  in 
this  report  to  prepare  such  a table. 


5.  RUNS 


5. 1 Observed 


Another  way  of  examining  persistence  is  to  consider  the  number  of  runs  of 
exactly  x hours  in  length,  that  is,  n(FSjE),  n(ES,,E). ,,,  niES^E).  The  relative 
frequency  of  runs  is  given  by  the  expression: 


RE(ES  F)  = 


n(ES  F) 


N-nu-n 


n(FS  F) 
x 


<!)) 


where  niFS^F)  is  the  observed  number  of  runs  of  exactly  x hours  in  length  and  N 


is  the  total  number  of  hours  in  the  data  sample. 

The  observed  number  of  runs,  based  on  28,080  hours  of  winter  observations, 
ami  28,704  hours  of  summer  observations,  at  each  of  the  nine  stations  is  given,  for 
selected  hours,  in  Tables  12  to  16.  The  median  values  are  indicated  with  asterisks. 
Although  the  frequencies  are  based  on  more  than  28,000  observations  at  each  sta- 
tion and  season,  there  are  large  sampling  variations.  Some  examples  found  in 
Table  12,  in  winter,  are:  EGA  had  only  48  runs  of  2 hours  of  GE  10  while  the 
nearby  station  JFK  had  80;  Fill,  had  only  2 runs  of  24  hours  hut  5 runs  of  20  hours; 
and  Hit'  had  17  runs  of  5 hours  hut  22  runs  of  6 hours. 

To  model  the  runs  it  is  assumed  that  one  good  model  can  estimate  runs  at  any 
of  the  nine  stations,  at  least  as  well  as  a 12 -year  data  sample. 

A model  was  considered  that  is  very  similar  to  Eq.’  (61.  This  model  requires 


estimates  of  the  conditional  probabilities,  P(S|FS^).  Relative  frequencies  of  success 


given  a failure  and  x hours  of  successes  were  determined  from  the  data  with  the 
following  expression: 


RF(S'FSx> 


irrrsn- 


( 10) 


These  relative  frequencies,  a selection  of  which  are  given  in  Tables  17  to  20,  were 
used  to  obtain  the  required  conditional  probabilities. 
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Table  13.  Observed  Number  of  Runs,  n(FSxF)  of  x Hours  in  Length  Observed  in  the  Data  Sample  and  Estimated 


Table  14.  Observed  Number  of  Runs.  n(FSxF),  of  x Hours  in  Length  Observed  in  the  Data  Sample  and  Estimated 
Through  the  Use  of  Eq.  (11)  When  GE  5 Miles  Visibility  is  Considered  a Success.  Median  values  are  identified 
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Observed  Number  of  Runs,  n(FSxF),  of  x Hours  in  Length  Observed  in  the  Data  Sample  and  Estimated 


Table  17.  Relative  Frequency  of  Success  Given  a Failure  and  x Hours  of  Success,  RF(s'fSxI,  Obtained  From  the 
Data  Sample  When  GE  10  .Miles  Visibility  is  Considered  a Success.  Median  values  are  identified  with  asterisks 
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Table  19.  Relative  Frequency  of  Success  Given  a Failure  and  x Hours  of  Success.  RF(S!fSx),  Obtained  From 
the  Data  Sample  When  LE  3 Miles  Visibility  is  Considered  a Success.  Median  values  are  identified  with 
asterisks 
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5.2  Modeled 

The*  probability  of  a run  of  exactly  x hours  in  length.  P(FSxF),  is  the  probability 
that  there  will  be  a failure  followed  by  x successes  followed  by  another  failure.  This 
might  be  estimated  as  follows,  for  runs  of  length  1.  2.  and  x hours,  respective- 
ly: 

P'  (FSxF>  ■ P(F)  P(S!F>  P(F|FSx) 

P'  (FS^F)  » P(F»  IMS1  F)  P(s|fSx_,)  P< F 1 FS^ ) 

P*  <FSxF)  » IMF)  P(S’f>  P(sIfSj)  P(S  1 FS,,1. . . IMS  1 FS^,)  P(F,FSx)  x?  3 J 

where  1'(S|f)  is  the  estimated  probability  of  a success  given  that  a failure  occurred 
the  previous  hour,  PiS^FS.l  is  the  estimated  probability  of  a success  given  that  a 

• a I 

success  occurred  and  a failure  occurred  2 hours  earlier P(F' FSx>  is  the 

estimated  probability  of  a failure  given  that  x successes  occurred  the  previous  x 
consecutive  hours  and  a failure  occurred  x*  1 hours  earlier.  The  unconditional  and 
conditional  probabilities  can  be  estimated  from  the  relative  frequencies  but  very 
large  samples  of  data  are  required  to  obtain  statistically  stable  relative  frequencies 
of  long  runs,  because  they  are  rare  events. 

The  points  plotted  in  Figure  7 show  the  nine-station  median  relative  frequencies 
of  success  given  a failure  and  x hours  of  successes  have  occurred.  The  median 
values  are  given  in  Table  21.  Smooth  curves  were  subjectively  drawn  through  the 
points  in  Figure  7.  The  probabilities  required  for  the  solution  of  Fq.  (Ill  were 
estimated  from  these  curves. 

Table  21  shows  values  of  P(S 1 FSx>  that  were  estimated  from  the  curves  shown 
in  Figure  7.  The  conditional  probabilities  always  Increase  for  at  least  8 hours  and 
most  of  the  values  increase  for  at  least  12  hours. 
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Figure  7.  Relative  Frequencies  of 
Success  Given  Exactly  x Hours  of 
Consecutive  Success  Have  Occurred, 
in  Winter  (a)  and  Summer  (b).  The 
curves  were  subjectively  drawn.  The 
dashed  portions  of  the  curves  are 
based  on  fewer  than  30  cases 


The  values  found  in  Table  21  were  used  to  solve  ICq.  (11).  By  substituting 
P'fFS^F)  from  Eq.  (11)  for  RFfFS^F)  in  Eq.  (0)  the  following  expression  is  obtained 
for  estimating  n(FS  F) 

ftfFS^F)  « F'(FSxF)N  . (12) 

Solutions  to  Eq.  (12)  are  given  in  Tables  13  to  16.  The  agreements  between  the 
observed  number  of  runs  and  those  calculated  from  Eq.  (12)  are  very  good.  It 
should  be  understood  that  this  is  not  an  independent  test  of  Eq.  (12)  hut  rather  a 
subjective  fitting  to  the  data  to  obtain  conditional  probabilities  and  an  objective 
method  for  finding  the  desired  probability  estimates. 
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6 KKXKKFM.K 


6.1  OlwntMl 

The  relative  frequency  of  the  recurrence  of  a success  * hours  later  Riven  that 
a success  occurred.  HF(S  ^S).  can  he  determined  from  the  data  by  dividing  the 
number  of  occurrences  of  successes  spaced  * hours  aoart.  n(SS^).  bv  the  total 
number  of  successes.  n(S).  that  is, 

n(SSj) 

KK(Sjs).  -nTST--  U3) 

Conditional  recurrence  relative  frequencies  based  on  13  years  of  hourly 
observations  taken  at  each  of  the  nine  stations  are  given,  for  selected  hours,  in 
Tables  22  to  25.  The  median  values  for  each  season  are  also  given  in  the  tables 
and  plotted  in  Figures  8 to  U.  The  curves  for  summer  show  a pronounced  24- 
hourly  period. 


6 


6.2  Modeled 

McAllister"  proposed  an  expression  of  the  form 

.b 


V P(Ste*»  * l 


-a«- 


<14> 


for  estimating  recurrence  probabilities  of  cloud  cover.  He  used  a 0.  263  and 

7 

b • 0.  632  as  the  best  estimates  of  the  parameters.  Gringorten  showed  that  Fq. 
(HI  yields  probability  estimates  very  close  to  those  obtained  from  the  bivariate 
normal  distributions  if  the  parameter  b is  fixed  at  0.  620  and  a is  allowed  to  vary 
with  the  climatic  frequency  of  the  event  and  the  basic  persistence  of  the  element. 


6.  McAllister.  C.  K.  (l"60>  Cloud-cover  recurrence  and  diurnal  variation, 

.1.  Vppl.  Meteor.  8:7  60-777, 

7.  Gringorten.  1.  I.  (107  1)  Modeling  conditional  probability,  .1.  Appl.  Meteor. 

10:646-657. 
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Figure  8.  Relative  Frequencies  of  a 
Success.  GE  10  Miles  Visibility,  l 
Hours  Later  Given  a Success  Has 
Occurred,  in  Winter  (dots)  and  in 
Summer  (X's).  The  solid  curve  is  the 
solution  to  Eq.  (15)  with  a'  = 0.  158 
for  winter,  and  the  dashed  curve  is 
for  summer  with  a'  = 0.  163 
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Figure  9.  Relative  Frequencies  of  a 
Success,  GE  5 Miles  Visibility,  l 
Hours  Later  Given  a Success  Has 
Occurred,  in  Winter  (dots)  and  in 
Summer  (X's).  The  solid  curve  is  the 
solution  to  Eq.  (15)  with  a'  * 0.  224 
for  winter,  and  the  dashed  curve  is  for 
summer  with  a'  = 0.  175 
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Figure  10.  Helative  Frequencies  of 
a Success,  LE  3 Miles  Visibility,  i 
Hours  Later  Given  a Success  Has 
Occurred,  in  Winter  (dots)  and 
in  Summer  (X's).  The  solid  curve  is 
the  solution  to  Eq.  (15)  with 
a1  » 0.280  for  winter,  and  the  dashed 
curve  is  for  summer  with  a'  * 0.  283 


Figure  11.  Relative  Frequencies  of 
a Success,  LE  0.  25  Miles  Visibility 
in  Winter  and  LE  1 Mile  in  Summer, 

*>  Hours  Later  Given  a Success  Has 
Occurred,  in  Winter  (dots)  and  in 
Summer  (X's).  The  solid  curve  is 
the  solution  to  Eq.  (15)  with 
a'  » 0.425  for  winter,  and  the  dashed 
curve  is  for  summer  with  a'  ■ 0.  524 


Eq.  (14)  was  modified  to:  (1)  eliminate  any  possibility  of  obtaining  probability 
estimates  greater  than  one:  (2)  take  into  account  diurnal  periods  in  weather  events; 
and  (3)  obtain  climatic  estimates  independent  of  the  (nitial  hour.  The  new  equation 
was  expressed  as  follows: 

P(S<!s>  - ^(l-e*a'ib)  <YZ)  + e"*'^  (W)  j 05) 

where  YZ  is  the  temporal  average  of  the  product  of  the  two  probabilities,  that  is, 

23 

,,6*> 

Where  Y and  7.  are  probabilities  or  success  at  time  t and  tti  hours,  respectively; 
and,  W is  the  temporal  average  of  the  lower  of  each  pair  of  probabilities,  that  is, 

23 

VJ  = £ W.  (15b) 

**  i-0  1 

where  W^aY^  or  whichever  is  smaller. 

Table  26  shows  that  there  is  a pronounced  diurnal  period  in  visibility  occur- 
rences. The  hourly  climatic  frequencies  of  the  events  were  substituted  into  Eq.  (15) 
using  i » 12  hours  and  b » 0.  620  and  the  equation  was  solved  to  find  the  parameter 
a'.  The  a'  values  are  given  in  Table  27. 

Eq.  (15)  was  solved  for  lags  from  1 to  7 1 hours  using  the  a'  values  given  in 
Table  27.  The  resulting  curves  are  shown  in  Figures  8 to  11.  The  fits  to  the 
summer  relative  frequencies  are  excellent  for  all  visibility  categories.  The  fits 
to  the  winter  values  are  also  believed  to  be  acceptable  for  most  purposes. 


7.  REMARKS 

Relative  frequencies  of  persistence,  runs,  and  recurrence  of  visibility  along 
the  east  coast  of  the  United  States  between  New  York  and  North  Carolina,  presented 
in  this  paper,  are  based  on  more  than  250,000  hourly  observations  taken  in  winter 
and  a similar  number  taken  in  summer.  They  are  believed  to  be  good  approxima- 
tions of  the  true  probabilities. 

Models  are  presented  for  use  in  estimating  joint  and  conditional  probabilities. 
The  estimates  are  usually  in  good  agreement  with  the  relative  frequencies  when  the 
parameters  are  carefully  chosen.  However,  the  best  parameters  for  the  Central 
East  Coast  area  of  the  United  States  may  not  be  the  best  for  other  geographical 
areas.  Future  studies  will  be  extended  to  other  areas  and  to  improving  the  models. 
Other  weather  elements  are  under  study  at  the  present  time. 
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Table  26.  Nine-Station  Median  Relative  Frequency  of  Each  Visibility 
Category  for  Each  Hour  of  the  Day 


Hour 

(LST) 

Winter  Category 
S.25  S3  s 5 

S 10 

Summer  Category 
si  <3  S5 

S 10 

00 

.0162 

. 134 

.808 

.577 

. 007  52 

mm 

.489 

01 

.0179 

. 138 

.805 

.560 

. 00920 

rUcS 

.876 

.459 

02 

.0205 

V,  *S 

m ' : m 

.561 

.0142 

.426 

03 

.0231 

. 138 

.813 

.556 

.0209 

Mem 

.818 

.389 

04 

.0214 

. 138 

.807 

.543 

. 0359 

. 151 

.777 

.364 

05 

.0273 

. ISO 

.806 

.530 

.0602 

.207 

.705 

.303 

06 

.0248 

. 156 

.793 

.508 

.0619 

.231 

. 685 

.308 

07 

.0273 

.212 

.726 

.424 

.0518 

.211 

. 691 

.324 

08 

.0273 

.233 

. 689 

.378 

.0251 

. 157 

.756 

.348 

09 

.0188 

.238 

. 688 

.386 

.0109 

. 120 

.828 

.385 

10 

.0162 

.217 

.716 

.425 

.00836 

.097  0 

. 854 

.454 

11 

.0120 

. 185 

.761 

.493 

. 00502 

.0694 

. 895 

. 515 

12 

.0102 

. 173 

.783 

.544 

.00418 

.0535 

.912 

.548 

13 

. 00940 

. 155 

.810 

.572 

.00418 

.0502 

.918 

. 574 

14 

.00855 

. 144 

.820 

. 589 

.00334 

. 0502 

.927 

.592 

15 

. 00940 

. 149 

.820 

. 600 

. 004  18 

.0485 

.926 

.598 

16 

.00120 

. 148 

.816 

.591 

.00418 

.0460 

.928 

.598 

17 

. 00855 

. 147 

.815 

.552 

.00502 

.0426 

.932 

.604 

18 

.0102 

. 120 

.838 

. 562 

. 00502 

.0465 

.915 

.599 

19 

.0102 

. 116 

. 843 

. 579 

.00585 

.0602 

.901 

.553 

20 

. 00940 

. 119 

. 843 

.592 

. 00585 

.0485 

.912 

.528 

21 

.0120 

. 127 

.832 

.592 

.00418 

.0426 

.914 

.551 

22 

.0154 

. 128 

.827 

.588 

.00418 

.0493 

.909 

.529 

23 

.0154 

. 138 

.823 

.579 

.00585 

.0518 

.903 

.520 

Table  27.  The  "a'"  Values  Used  in  Eq.  (15)  to  Find  the  Curves 
Phown  in  Figures  8 to  11 


Visibility  Category 

Season 

GE  10 

GE  5 

LE  3 

LE  0.25  (Winter) 
LE  1.  00  (Summer 

Winter 

0.  158 

0.  224 

0.260 

0.425 

Summer 

0.  163 

0.  17  5 

0.263 

0.524 
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